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i ' Abstract 

> : 

I analyse the conceptual and mathematical foundations of Lagrangian quantum 
field theory (that is, the 'naive' quantum field theory used in mainstream physics, 
' as opposed to algebraic quantum field theory). The objective is to see whether La- 

t-h . grangian quantum field theory has a sufficiently firm conceptual and mathematical 

basis to be a legitimate object of foundational study, or whether it is too ill-defined. 
^5 The analysis covers renormalisation and infinities, inequivalent representations, and 

the concept of localised states; the conclusion is that Lagrangian QFT (at least as 
described here) is a perfectly respectable physical theory, albeit somewhat different 
£h ' in certain respects from most of those studied in foundational work. 

3 : 

1 Introduction 

X i FW» its be gMng , q _ Md th e OIy (QFT) has been p,a gU e d with mat he m a tlC a, 

difficulties. Any attempt to apply the theory to interacting systems led to the appearance 
of infinities, and although methods were found[] to remove those infinities from the calcu- 
lated cross-sections by cutting off certain integrals at finite (though very high) energies, 
these methods had a sufficiently ad hoc feel that the theory was not felt to be on remotely 
satisfactory conceptual ground. 

From the 1950s onwards, there has been a major program to resolve this problem 
by reformulating QFT on an axiomàtic basis: that is, starting from what seem to be 
physically necessary — and mathematically precise — principies which any QFT would 
have to satisfy, and then finding QFTs which actually satisfy them. This program is now 



generaliy referred to as algebraic quantum field theory, or AQFT; see |Haag (1996j ), and 
references therein, for extensive discussion of it. 



1 By Dyson, Feynmann, Schwinger and Tomonaga, amongst others; see Cao (1997, pp. 185-209) for 
a historical discussion. 
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The major problem with AQFT is that very few concrete theories have been found 
which satisfy the AQFT axioms. To be precise, the only known theories which do satisfy 
the axioms are interaction-free: no examples are known of AQFT-compatible interacting 
field theories, and in particular the Standard model cannot at present be made AQFT- 
compatible. 

Nonetheless, the great majority of foundational discussions of QFT are carried out in 
an AQFT framework, for obvious reasons: the precise axiomàtic nature of AQFT makes 
it ideal for foundational study, since we can specify with mathematical precision exactly 
what the entity is that we are studying. 

Meanwhile, the mainstream physics community has continued to work with what we 
might call (for want of a better name) "Lagrangian" QFT, the original QFT developed in 
the 1930s and whose infinities were tamed (at least pragmatically speaking) in the 1950s. 
In Lagrangian QFT, an entirely different program exists for understanding those infinities, 
in terms of the renormalisation group; this program, due primarily to Wilson, does not 
eliminate the finite cutoffs introduced to deal with the infinities, but rather attempts to 
'legitimize' them. It is in the framework of Lagrangian QFT that the Standard Model is 
formulated. 

This paper is an investigation of whether Lagrangian QFT is sufficiently well-defined 
conceptually and mathematically that it too can be usefully subjected to foundational 
analysis. The reasons for making such an investigation are threefold. Firstly, the problem 
with restricting our foundational studies to AQFT is that — pending the discovery of a 
realistic interacting AQFT — we have only limited reason to trust that our results apply 
to the actual world, which appears to be described rather well by the Standard Model. 

One response to this criticism is to say that Lagrangian QFT is so mathematically 
ill-defined that we cannot regard it as a proper theory at all, so we cannot trust its results 
either! Our second reason for the investigation is then to see whether this criticism is 
justified. 

The third motivation is somewhat more philosophical. Wilson's explanation of the 
renormalisation procedure relies upon the failure of the QFT to which it is applied at 
very short distances. It is then intriguing to ask how to put on a firm conceptual footing 
a theory which relies for its mathematical consistency on its own eventual failure. 

The structure of the paper is as follows. Section |2| reviews the formal definition of 
a Lagrangian QFT. This definition is very formal and mathematically totally ill-defined; 
sections |3] and [| address this problem, dealing respectively with renormalisation and with 
the existence of inequivalent representations of the observable àlgebra. Section |^ considers 
the problem of how to define local objects in QFT. This is also a problem in AQFT and 
much of this section's argument applies in that domain; however, the framework of sections 
|2|-|] offers some interesting insights. Section |] is the conclusion. 

Three comments should be made before the main part of the paper: 
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1. This paper is not intended as an attack on the AQFT program. The program to 
construct an AQFT-compatible interacting field theory is well-motivated and, if suc- 
cessful, would be of enormous import; the foundational results which have emerged 
from AQFT have been of considerable importance in understanding quantum field 
theory and in general they apply also to Lagrangian QFTs. This paper should be 
read as complementary to, rather than in competition with, these results. 

2. There is a strong tradition in foundational studies of QFT to treat the theory from 
an operational viewpoint, formalised by regarding the local algebras of AQFT as 
describing operations which can be carried out by an observer localised in a given 
region. This paper does not follow that viewpoint: it treats QFT instead as a closed 



theory (a 'universal theory' in Deutsch's (|1985|) sense), and regards measurement 
devices, observers etc. as just subsystems of the QFT state. This difference of 
approach has only limited significance for most of the paper but is important in 
section I 



3. Although much of the discussion below applies equally to bosonic and fermionic 
field theories, the specific examples discussed are exclusively bosonic. 



2 What is a Lagrangian QFT? 

For better or for worse, most canonical quantum field theories are found by starting with a 
classical field theory and then 'quantizing' it. To be sure, there is something intellectually 
unsatisfactory about this: given that quantum theory is the more fundamental theory, 
we would prefer to work in the other direction, that is, to recover classical field theories 
from quantum starting points (see Deutsch (1984 ) for a development of this criticism). 



Nonetheless, the classical starting point has proven a powerful method for finding QFTs, 
and we adopt it here. 



2.1 Classical field theories 

A classical relativistic field theory can be considered as consisting of a relativistic space- 
time Ai (such as Minkowski spacetime), a set of fields (that is, maps from Ai to some 
other space, such as the real or complex numbersQ), and a Lagrangian density: a real- 
valued function C on Ai whose value at a given point x depends only on the fields and 
their first partial derivatives, evaluated at x. For a given region D of Ai, we define the 
action Sx> as the integral of C over T>; St> is thus a functional of the fields, and we define 

2 In more advanced treatments, we might take the fields to be sections of some fibre bundle over M\ 
this technicality does not significantly affect the arguments of the paper, and will be ignored. 
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the dynamically allowed field configurations within T> to be those for which S?> is extremal 
under variations of the fields which vanish at the boundary of D. 

To go further, it is necessary to take M. to be globally hyperbolic, and then to fix a 
foliation M. = X x R. Let Si, £2 be any two spatial slices in the foliation, with associated 
time coordinates ti,t 2 respectively; let the spacetime region between the two slices be 
T>i 2 . If the fields are required to fall off rapidly enough at spatial infinity, then we can 
define the action Sx> 12 : 



where d 4 /i is the four-dimensional volume element on M. and d 3 /z(í) is the induced three- 
dimensional volume element on S. As the notation indicates, the three- volume element is 
in general time-dependent; however, when Ai has a time-translation symmetry then we 
can always choose a foliation such that this time- dependence vanishes. 
We can now define the Lagrangian L(t) of the theory by 



L is now a functional of the fields on E and their time-derivatives, and the action Su 12 is 
given simply by 



so the field theory is now in Lagrangian form (albeit infinite-dimensional). 

The final step before quantisation is to transform from Lagrangian to Hamiltonian 
form, which we do by a straightforward infinite-dimensional gener alisation of the Legendre 
transform in classical mechanics: to each field fa we associate a conjugate momentum field 

■Ki by 




(1) 




(2) 




(3) 




(4) 



and we define the Hamiltonian H by 




(5) 



where fa is defined implicitly in terms of the ttí, via @). Points in the phase space V of 
the field are then given by specifying sets of functions (fa, ttí) (for all i); it is easy to check 
that the dynamics are given by Hamilton's equations, as in the finite-dimensional case: 



^( x ) = — TTTTTT • (7) 



The Poisson bracket on V is given by 
{A[<P,n],B[^n]} = Y: f d 3 x 




SA 5B 

ÓTTi(x) SÒA-X 



and <pi and ttí obey natural infinite-dimensional anàlogues of the canonical relations: 
{0i(x), (f>j(y)} = {tiï(x), 7Tj (y) } = and {&(x), vr^y)} = í<jí(x - y). 
Before going on to quantise this theory, we make two observations: 

1. Despite the somewhat casual approach of this section, classical field theories have 
perfectly well-defined Lagrangian and Hamiltonian forms, and the above analysis 
can be carried out in a fully rigorous way; see chapter 7 of |Woodhouse (199l| ), 



chapter 3 of [Marsden and Ratiu (1994| ) and references therein for such an analysis. 



In this more rigorous approach it is necessary to replace the point functionals 0i(x) 
and 7Tj(x) with smoother functionals on phase space, and to eschew use of the delta- 
function Poisson bracket between such point functionals; such objects have a status 
not dissimilar to that of position eigenstates in non-relativistic quantum mechanics, 
in that they cannot easily be rigorously defined but (if used with a little caution) 
are of great conceptual and calculational convenience. 

2. The almost immediate introduction of a foliation of Ai may cause so me readers to 
worry about violation of relativistic covariance. However, recali that each point in 
V defines a unique trajectory through V, so that we may set up an isomorphism 
between phase-space and the space of all solutions to the field equations. The latter 
space is covariant (in the sense that it can be defined without use of any preferred 
foliation) and if desired all talk of 'phase space' may be reinterpreted as talk of 
solution space. 

Both of these points are developed further in [Wald (1994j) (from an AQFT perspective, 
and in the somewhat restricted context of free fields). 

For convenience, in the remainder of this paper we assume that we are dealing with a 
single real field, and so drop the subscripts on (f) and 7r; this has no significant consequences. 



2.2 Quantization: observables 

In an ideal world, quantization would work as follows: it would yield a Hilbert space 7í, 
together with a map Q from the functions on V (which can be thought of as the classical 
observables) into the space of self-adjoint operators on 7i. Q would have the property 
that, if {, } is the classical Poisson bracket, then Q({A, B} = i(Q(A)Q(B) - Q(B)Q(A)). 
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This is not an ideal world: quantization as defined above is provably impossible (see, 
e.g. , [Abraham and Marsden (1978|) for a discussion). What is possible is to find such a 
Q for a much more restricted class of observables: in nonrelativistic particle mechanics, 
for instance, we choose spatial position and the momentum conjugate to it. There is 
no known algorithm for choosing this restricted class, and no real reason to expect one 
to exist (c. f. the criticisms of quantization in Deutsch 1984 ); two different choices will 
sometimes lead to empirically inequivalent quantum theories, in which case experiment 
is the only way to determine which is correct.0 In general, however, Nature is fairly 
kind to us, and making the most obvious choice of observables tends to work. QFT is 
no exception: choosing 0(x) and tt(x) as preferred observables leads to the empirically 
correct theory. 

The question of which observables count as 'fundamental' is also relevant for the 
interpretation of the quantum theory which we hope to produce. For any fixed dimension, 
all Hilbert spaces are isomorphic, so it is through the observables that we are able to give 
physical meaning to a theory In practice, this usually involves a connection between 
certain observables and the spacetime of the theory: in NRQM, for instance, it is the 
designation of the momentum operators as generators of translations which allows us to 
identify certain states as eigenstates of position.0 The obvious strategy in QFT is to use 
the spatiotemporal dependence of the observables to give the theory physical meaning: 
when quantised, 0(x) and 7r(x) will become operators which we will treat as localised at 
x. 

This approach can be taken further. First we shift from the Schròdinger to the Heisen- 
berg picture, so that the field operators are functions of spacetime points x rather than 
just spatial points x. Then, given any (bounded, open) subset O of Ai, we define the 
àlgebra A(0) as the àlgebra of all operators which can be constructed from <fi(x) and 
íf(x) whenever x G O — so O includes such operators as 7r(a;) 2 (a; G O), J f(x)(f>(x), etc. 
This gives us (formally) what is technically known as a net of operator algebras: a map 
from bounded, open subsets of a topological space into operator algebras, such that if 
0\ Ç O2, then A{Oi) is a subalgebra of AÍÜ2) ■ It is a central claim of algebraic quantum 
field theory — and one which appears consistent with the way QFT is used in physics 
- that a QFT is entirely specified by the structure of this àlgebra net: in other words, 
that all physical facts about the theory can be determined from knowing the spacetime 

3 Carlip (1998| ) points out (p. 97) that 'old quantum theory' can be (somewhat anachronistically) 
undcrstood as wrongly selecting action-angle variables as the preferred observables. 

4 The argument, adapted from [Newton and Wigner (1949| ), is as follows (in one dimension): if a state 
\<j>) is localised at some point, then any translation exp(iAP) applied to it will leave it localised at a 
different point; thus exp(iAP) \<j>) must be orthogonal to \4>); hence the position eigenstates are precisely 
those satisfying {<f>\ exp(iAP) \<f>) for all A. If {|fc)} are the (improper) eigenstates of P, and if we define 
\x) = J dk exp(-ikx) \k), then (x\y) — 8{x — y), and exp(iAP) \x) = \x — A); hence, (x\ exp(iAP) \x) — 
(x\x — A), and so the \x) states are exactly localised. 
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dependence of the observables, without any need to know (for instance) which observable 
is 7?(x) and which is 4>{x). In particular, this allows the possibility that field theories 
generated from different classical Lagrangians are actually the same QFT; two such field 
theories are generally referred to as Borchers equivalent. 

As such, to specify an algebraic quantum field theory, all that is needed is a net of 
algebras, which is required to satisfy certain axioms (basically relating to locality and 
causality; see |Haag (1996| ) for an extended discussion of these axioms). Any such AQFT 
implicitly specifies a Borchers equivalence class of 'concrete' QFTs each of which generates 
the same àlgebra net; all such concrete QFTs are Borchers equivalent to one another, and 
they are usually regarded simply as different ways of describing the AQFT, analogous to 
different coordinate systems on a manifold. 

The assumption that the net of algebras captures all physical information about a 
theory gives a very elegant realisation of the idea that quantum theories gain physical 
content via a specification of the spatiotemporal properties of their observables. We will 
refer to it as the Net Assumption. 

We have glossed one question, though: what does it mean to say that an operator is 
localised somewhere? In much of the AQFT literature an operationalist answer is given 
to this question: operators represent physical operations which can be performed, by 
the observer, on the QFT state, and in particular A(0) is the àlgebra of all operations 
which can be performed, and all measurements which can be made, by an observer who 
is external to the field system but localised within O. However — as mentioned in the 
Introduction — we wish to understand a given QFT as self-contained (in which case 
observers, measurement devices and the like must be built out of states of the QFT) so 
this approach is not available to us. Instead, we take the expectation vàlues of operators 
in a given spatial region as giving us information about the degree of excitation of the 
field in that region. This view will be further developed in the next section. 



2.3 Quantization: states 

We now address the practical task of actually finding the operator representations of the 
field observables. In NRQM we usually do this via wave-functions: we take our Hilbert 
space to be the space of complex functions %fj{q) on the configuration space of the classical 
theory, and quantize the classical observables q, p via 

qi\) = qip{q); pip = ~ i -^-· ( 9 ) 

Formally (and only formally; but see sections ||] and |j) we can easily generalise this 
to infinite-dimensional systems: the configuration space of the system is the infinite- 
dimensional space of all field configurat ions on E, so the quantum states will be complex- 
valued functionals on that space. We will denote the Hilbert space of such functionals by 
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7Ys, and proceed to quantize the classical observables 0(x) and tt(x) as 



(0(x)*)[x] = x(x)*[x]; 

5* 



<^X(Xy 



(10) 
(11) 



where x is an y fi e ld configuration on E. It is easy to check that infmite-dimensional 
generalisations of the canonical commutation relations are satisfied: 



x),0(y)J = [7r(x),7r(y) 
(x),0(y)l = i5(x- 3 



0; 



(12) 
(13) 



(For simplicity we have considered only the case of one real field, but the generalisation 
to múltiple fields is obvious.) 

Now the Hilbert space TLy, has a natural tensor-product structure. Let Ei, . . . , E n be 
disjoint subsets of E whose union is E; then (formally) we can define Tij as the space of 
complex functionals on functions on Ej. Any function / on E can be specified uniquely 

hence we have 



by giving those n functions which are the restrictions of / to each E^; 



(14) 



If x G Ej then 0(x) and 7r(x) act trivially on 7í·£ i for i ^ j. Hence, the can be 
thought of as representing those subsystems of the field which are localised within Ej. 
This viewpoint can be reversed: we can take spatially localised subsystems of the field 
as our starting point, in which case 0(x) and 7r(x) are localised in Ej because they act 
trivially on the other Tí^- 

The reader is likely to have noticed that none of the above would win prizes for math- 
ematical rigor! Indeed, on the face of it the approach of this whole section is appallingly 
badly defined. It will be the purpose of sections ^| and f| to explain how we can attain 
peaceful coexistence with this ill-definedness. 



3 Infinities and renormalisation 

We now have at least a formal definition of a quantum field theory; but it is, clearly, 
profoundly unsatisfactory in many respects. In this section we consider the question of 
how, and to what extent, it is possible to restore some degree of mathematical rigour to 
the theory. 
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3.1 The problem of infinities 



Where to start in analysing the mathematical shortcomings of section |2.3| ? We could 
begin by noting the various ambiguities glossed over when considering functionals on an 
infinite-dimensional space: what topology should be placed on the space of functionals? 
How smooth must a field configuration be to be admitted?, etc. However, there is a more 
crucial problem which applies to any attempt to make wave-functional space a Hilbert 
space. This is the problem of how to define the inner product. In the finite case, of course, 
we define the inner product of two wave-functions by 



i?hM= I à n qip* 1 (q u ...,q n )ÍJ2(qu... } qn); (15) 
J?R n 

generalising this to wave-functionals gives a functional integral: 

(*i,* 2 )= / Vxnix}*2[x], (16) 
Js 

where S is field configuration space and T>x is a 'functional measure' on S. 

Unfortunately, defining measures on infinite-dimensional spaces is extremely hard. 
Indeed, simple attempts to define T>x tend to give infinity as the result of the functional 
integral, which clearly isn't satisfactory (see, e. g. , Binney et al ( |1992| , p. 409) for an 
example of how this occurs). The integral can be made well-defined if we restrict ourselves 
to a finite-dimensional subspace of S, but as soon as we start to consider functions which 
can vary on arbitrarily short length-scales, we lose the ability to define it. 

Blithely ignoring this little problem, we can press on with the development of our QFT, 
but we soon run into other infinities: in particular, when we try to calculate the effects 
of interactions, we find that our calculations include terms which involve integrating over 
arbitrarily short length-scales, and that some of these terms are infinite when the integral 
is taken over such length-scales. With care we can avoid the infinities in free-field theories 
(such theories can be exactly defined in AQFT) but no examples are known of realistic 
QFTs in which these infinities do not have to be confronted. 

One highly principled attitude to the problem might be to say: very well, none of our 
supposed 'interacting QFTs' count as real theories, so let us reject all of them and go 
looking for properly defined ones. Needless to say, though, this is not the mainstream 
approach in particle physics, where algorithms for extracting useful information from QFT 
despite the infinities have been known for nearly fifty years. To understand why these 
algorithms work — and why they can be understood not just as 'algorithms', but as a vàlid 
resolution of the problem of infinities — we take a brief digression into condensed-matter 
physics. 
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3.2 High-energy cutoffs and renormalisation 

Functional integrals are not restricted to relativistic quantum theory: they occur through- 
out physics, and in particular in condensed-matter physics. There too they formally lead 
to infinities; however, there is no question of a conceptual problem. For the integrals 
are taken on the assumption that (for instance) the density of matter is continuously 
varying, and can vary on arbitrarily short length-scales. But this continuum assump- 
tion can only be an approximation, for matter is made of àtoms, and any variations on 
a length-scale short relative to the interatomic distance clearly lie beyond the scope of 
the approximation. So the functional integral must be cut off at some short, but finite, 
length-scale. 

It might be thought that this would lead to enormous computational difficulties, for 
the functional integral is dominated by the short-distance variations and hence the precise 
details of the cutoff procedure ought to be important. Remarkably, this is not so: it can 
be shown that if we restrict our attention to the behaviour of the system on lengthscales 
which are very long relative to the cutoff, then: 

1. All but a finite number of possible interaction terms (the so-called 'renormalisable' 
terms) have negligible effect on the system's behaviour. 

2. The only effect of the high-energy (i.e. short-distance) degrees of freedom on the 
system's long-distance behaviour is to modify ('renormalise') the coeflicients of the 
renormalisable interaction terms. As such, the only effect of getting the cutoff 
details wrong is to change the effective vàlues of the coeflicients in the Hamiltonian. 
If, instead of trying to calculate these effective vàlues based on the true vàlues (which 
in any case is often impossible) we simply measure them, the details of the cutoff 
procedure are completely irrelevant to the long-distance behaviour of the system. 



These results were established in the 1970s, primarily by Kenneth Wilson (see [Wilson 



and Kogut (1974| ) and references therein); Binney et al ( |1992| ) give an exceptionally lúcid 



exposition. 

At least mathematically, this process can be applied equally well to relativistic QFTs: 
instead of allowing our functional integrals to range over all field configurations, we restrict 
them to only those configurations whose short-distance variation is not too quick (the 
conceptually simplest way, at least in Minkowski spacetime, is to exclude all configurations 
whose Fourier coeflicients vanish above some fixed value of |k|, where l/|k| is the intended 
cutoff length) . This yields a well-defined theory, and if we require the cutoff length to be far 
smaller than the lengthscales at which we study the theory, then that cutoff length affects 
the theory's predictions only through renormalisation of the coeflicients in the effective 
Hamiltonian.0 Since the latter coeflicients are in any case only known empirically, this 



See chapter 12 of Pcskin and Schrocdcr (1995| ) for the technical details of this process. 
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has no practical consequences. 

What physical justification might there be for imposing a cutoff in relativistic QFT? 
Three possibilities are generally mentioned in the literature (see, e. g. , |Cao ( 1997| ) or 
Binney et al QI992D for details): 



It is easy to construct field theories in which some of the degrees of freedom are 
'frozen out', i. e. become irrelevant, below some given energy scale. Below this scale, 
the field theory can be described by an 'effective field theory' (EFT), which does 
not explicitly include the frozen-out degrees of freedom; however, those degrees 
of freedom do have the effect of imposing an effective cutoff of the EFT. This is 
believed to happen for a number of actually studied low-energy theories, whose 
cutoff energies are experimentally accessible; see Binney et al ( |1992| , pp. 372-373), 



for a brief discussion. Of course, when a cutoff is generated this way, it relies on the 
existence of the 'true' field theory, which in turn can only be rigorously defined with 
a cutoff of its own. This implies either some infinite tower of EFTs, which describe 
physics at successively higher energy scales, or else some other cutoff mechanism 
which truncates the tower. 

• Possibly at sufficiently high energy scales (say, beyond the 'Grand Unification en- 
ergy' ( Peskin and Schroeder 1995|) , about 10 16 GeV, above which the strong and 
electroweak forces are conjectured to become unified) the entire field-theoretic de- 
scription of physics may break down and be replaced by some other theory (such as 
string theory). This new theory would then impose an effective cutoff, and might 
itself be free of infinities. 

• At still higher energy levels (and hence still shorter lengthscales), it is widely believed 
that the concept of spacetime as a continuum will itself break down, to be replaced 
by some quantized version. On dimensional grounds this is expected to happen 
at around the Planck energy, or 10 19 GeV; this is equivalent to a breakdown on 
lengthscales of order 10~ 34 metres (the Planck length). 

In practice, the second and third alternatives may not be all that distinct: although string 
theory is usually formulated as a perturbative theory on a flat background spacetime, it is 
generally accepted that this background spacetime must eventually be eliminated from the 
theory — in which case, presumably, we would again expect spacetime to be an effective 
concept emerging only on lengthscales greater than the Planck length. 

3.3 The conceptual status of a cutoff QFT 

The introduction of a finite cutoff, then, mathematically resolves the infinities problem, 
and might turn out to be justifiable on a number of physical grounds. However, it is prima 
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facie unfortunate from a foundational perspective, for the following reason: in founda- 
tional work, it is usual to start with a well-defined (and, preferably, axiomatisable) theory, 
and then to investigate the implications of that theory. If QFTs are intrinsically approxi- 
mate theories, which can be trusted only at certain energy scales and whose behaviour is 
unknown at other scales, it is hard to take this attitude to them. Should we then regard 
a QFT as a well-defined theory at all, or (as Bucliholz (2000 ) has advocated) as 



an efficient algorithm for the theoretical treatment of certain specific problems 
in high-energy physics ( jBuchholz 2000| , pp. 1-2) 



Cao (1997, pp. 350-352) identifies three different, commonly adopted, attitudes to the 
foundations of QFT: 

(1) The current situation is genuinely unsatisfactory: we should reject the cutoff theories 
as not mathematically well-defined, and continue to look for nontrivial theories 
defined at all lengthscales. 

(2) The picture of 'an infinite tower of effective field theories' mentioned in the last 
section is to be taken seriously. As Cao stresses, this would require 

a dràstic change of our conception of fundamental physics itself, a change 
from aiming at a fundamental theory (as the foundation of physics) to 
having effective theories vàlid at various energy scales. (|Cao 1997] , p. 351) 

(3) QFTs as a whole are to be regarded only as approximate descriptions of some as- 
yet-unknown deeper theory, which gives a mathematically self-contained description 
of the short-distance physics. 

(1) is of course the viewpoint of the entire algebraic QFT program: that program's ul- 
timate long-term goal is to produce a mathematically rigorous description of interacting 
QFTs (specifically, of the Standard Model or of some successor to it). Obviously, com- 
pletion of that goal would dramatically change the foundational status of quantum field 
theory; it is only because the goal has not been achieved that it is interesting to investi- 
gate alternatives to (1). Conversely, anyone regarding either (2) or (3) as fully satisfactory 
from a foundational viewpoint would have reason to doubt whether the eventual success 
of the AQFT program is possible. 

(2) will not be discussed further here: adopting it would require dramàtic changes 
to many aspects of our attitude to the foundations of physics, and a discussion of such 
changes lies far beyond this paper. See |Castellani (2000| ) and [Hartmann (2000| ) for further 



discussion. 

The rest of this discussion will focus on (3), and its attendant foundational problem: 
how can we give a clean conceptual description of a theory which can be rigorously defined 
only as the low-energy limit of another theory which we do not yet have? 
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To see the problem here, contrast this case with a more straightforward one: the 
ciment status of classical mechanics (CM), given that we believe it to be in some sense 
superseded by non-relativistic quantum mechanics (NRQM). What is our justification 
for continuing — as in many situations we do continue — to use CM, given that we 
believe NRQM to be the more vàlid description? It is not enough to observe that NRQM 
tends to CM as h — > 0, for h does not tend to zero: it is a constant. Furthermore, 
simply considering successively larger systems (so that the quantity (/ï/typical action) 
tends to zero) is singular and contains various surprises (this matter is discussed in |Zurek| 
|1998b|) . A better approach is to identify what might be called 'classical domains' of 
NRQM: that is, domains of NRQM in which NRQM is approximately isomorphic to CM. 
(A system of large bodies decohered by interactions with an environment would be an 
example of such a domain.) Within such a domain, we could then tentatively apply CM, 
whilst remaining aware that CM could fail to give accurate answers to some qüestions, 
and that its predictions should only be treated as accurate to within the accuracy of the 
'approximate isomorphism' between CM and the restricted domain of QM. (This approach 
to inter-theoretic reduction is discussed in much more detail, and in the specific context 
of the relationship between CM and NRQM, in |Wallace (2001cj ); it is closely related to 



Cao's structural approach to QFT ( |Cao 199"7| chapter 12) and to the 'structural realist 
program inaugurated by |Worrall (1989| ) and discussed in, e. g. , ( [Psillos 1995fc [Ladymarï 
19981) .) 



Approach (3) presumes the existence of some theory — call it X — to which QFT is 
to be an approximation (in some domains). There are, however, a number of important 
differences between the X-QFT relationship and the NRQM-CM one: 

1. Most obviously, we know NRQM, whereas we have only the sketchiest idea of what 
X will turn out to be. This makes it difficult to delineate the domains in which QFT 
is approximately isomorphic to X, since we have to describe those domains in QFT- 
language rather than X-language. In particular, we would like to say that these 
domains are those in which the states do not vary on too-short lengthscales, but 
this implicitly suggests that it even makes sense to talk about states which do vary 
on such lengthscales - which may not be possible in X: indeed, X may not contain 
any elements which are even approximately isomorphic to too-rapidly-varying states 
of QFT. In solid-state physics, by analogy, it isn't even meaningful to talk about 
density fluctuations on lengthscales shorter than the interatomic separation. 

2. When using CM, in situations where we are unsure about its validity we can always 
cross-check with NRQM to check that its predictions agree with CM. Since we don't 
have X, we can't do this in QFT: the only tests available for its validity in describing 
a given phenomenon are empirical. 
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3. CM is a mathematically well-defined and self-consistent theory, whereas QFT as we 
have described it so far is not. 



It is useful to separate the first and second points from the third. For if we had a 
mathematically well-defined QFT, we would have a perfectly good physical theory which, 
however, we would have good reason to believe empirically inadequate for describing 
certain phenomena, even though we have no better alternative. This has been a common 
situation throughout physics: classical mechanics in the early 20th century is an example 
of such a theory; general relativity today is another. In fact, until and unless we come up 
with a theory which we are supremely confident is 'final', this should be our attitude to 
any well-tested physical theory: its predictive utility and explanatory power require us to 
take seriously those entities which the theory claims to exist (such as classical partides, 
or general-relativistic spacetimes), but to remember that 

• these entities may be patterns or structures in some more fundamental ontology, 
rather than fundamental entities in their own right; 

• our theory may describe the world not by virtue of being 'fundamental', but by 
virtue of being instantiated in the structure of a deeper theory; 

• this instantiation may be only approximate, leading to domains in which the theory 
fails. 



(See |Wallace (2001 b| ) for a discussion of the ontology of patterns and structures in physics. 



So, if we had a mathematically well-defined QFT then we could happily intèrpret it 
in the same way we now intèrpret CM. Can we obtain such a QFT? 

The easiest way to do so is to choose a concrete implementation of the short-distance 
cutoff. A simple recipe for such an implementation is as follows: 

1. Choose a cutoff lengthscale /. 

2. Choose a spacetime foliation. 

3. On each leaf of the foliation, replace the continuum of field operators 0(x), 7r(x) 
with a discrete grid of such operators (to be called grid observables) with the grid 
spacing being ~ l. 

4. Discretise the Hamiltonian, replacing its integral over functions of field operators 
with a discrete sum over operators on the grid. 

5. Replace the (formal) Hilbert space TLy, of the QFT (defined in section |27| as the 
space of complex functionals over all field configurations on S) with the space of 
functionals over some subset of field configurations, chosen to vary in some suitably 



14 



chosen way between points of the grid. (The details of how this is to be done don't 
matter — i. e. have no consequences for the large-scale behaviour of the model - 
provided that specifying the value of a field configuration at all grid points is enough 
to identify it uniquely). 

Such a theory is a close relative of the QM description of a crystal: in the latter, we 
specify the position and momentum of each crystal atom; in the former, the field strength 
and conjugate momentum at each gridpoint. (It is also very similar to the "lattice gauge 
theories" studied in lattice QCD.) It is also perfectly well-defined mathematically, and as 
such a vàlid entity to be viewed as approximately isomorphic to some subtheory of X. 

We are interested in the structure of states of such theories, when studied at length- 
scales 3> l. To give a precise meaning to this, recali that the physical meaning of a state 
is given by its expectation value on elements of the àlgebra of observables. We can de- 
fine the large-scale observables of a discretised QFT as those which are averages of grid 
observables over regions which are very large compared to the grid spacing; then we can 
regard the large-scale structure of a state as specified by its expectation value with respect 
to all large-scale observables. [] The small-scale structure of a state is not to be treated as 
physically significant, since it is defined at lengthscales at which we expect the approxi- 
mate isomorphism between X and our discretised QFT to break down. (Analogously, the 
structure of a classical phase-space distribution can be fairly reliably viewed as telling 
us about the actual world if it is studied on action scales large compared to h, but is 
probably meaningless on scales small compared to h). 

Now, obviously there is a very large number of possible discretisations of a field theory: 
one for each possible choice of grid, at least (and no claim is being made here that the 
method above is the only way of implementing the cutoff). But renormalisation tames 
this profusion of theories to some extent, for it implies that the large-scale structure of 
two different discretisations can be made to be virtually identical (and totally identical 
in the limit as the two theories are studied on larger and larger length-scales) by finite 
rescaling of the field observables and finite adjustments in finitely many parameters of 
the Hamiltonians of the theories. 

By analogy with section |2.2| , we can construct an àlgebra net by associating to each 
region O which is large compared with the cutoff length l, an àlgebra A(0) of large-scale 
observables localised in O. The statement that two discretised QFTs are structurally 
equivalent at large scales is then equivalent to the statement that both generate the same 



large-scale nets — a large-scale variant of the Net Assumption of section |272 . 



6 By 'grid observables' I mean the bàsic field observables 4>(x) and tt(x), not their pointwise products 
— so for instance J dxdy f(x)g(y)ip(x)4>(y) is a large-scale observable (if / and g vary slowly compared 
to l) but J dx f(x)4>(x) 2 is not, irrespective of how smoothly / varies. This means, in particular, that 
the energy density is not a large-scale observable. In fact, the energy density of a discretised QFT is (a) 
cutoff-dependent, and (b) extremely large: this is the origin of the 'cosmological constant problem'. 



15 



In fact, we can mimic the algebraic description of section [2.2| further, by defining our 
cut-off QFTs directly in terms of their algebraic structures: we define a scale-l algebraic 
QFT as a map from regions of M. which are large compared with l, to operator algebras, 
such that the axioms of AQFT are approximately satisfied on scales large compared with 
l. (Obviously this is not intended to be precise.) Two 'concrete' QFTs, such as two 
versions of the discretised QFT above, are then scale-l equivalent if they generate the 
same scale-l AQFT; hence scale-/ equivalence is a generalisation of Borchers equivalence. 

The results of renormalisation theory can now be simply stated: two discretisations 
of the same renormalisable QFT can be made scale-/ equivalent just by finite renor- 
malisations of the coefficients in one of them. So, for 4 QFT (for instance) there is 
a two-parameter family of scale-/ AQFTs, parametrised by the renormalised mass and 
coupling constant. 

Now if we know that a given scale-/ AQFT is empirically accurate, we can find any 
number of concrete QFTs which are members of its scale-/ equivalence class. We can 
choose any such member to calculate with, and make our choice based upon calculational 
convenience, but we need not regard any of them as candidates for the fundamental 
theory of nature. Rather, they are to be viewed as candidate theories to be approximately 
isomorphic to a subtheory of X. 

From this perspective, the profusion of discretisations is no embarassment. For sup- 
pose QFTi and QFT 2 are members of the same scale-/ equivalence class, and suppose 
the scale-/ AQFT corresponding to that equivalence class accurately describes our ob- 
servations (on scales large compared with /). Then it follows that QFT\ and QFT 2 are 
both approximately isomorphic to a subtheory of X[] with that approximate isomorphism 
breaking down around the cutoff lengthscale. But if so, then there can be no fact of the 
matter as to which of QFT\ and QFT 2 is 'correct': both are approximately instantiated 
by X. If X is itself a QFT (presumably some interacting QFT which exactly satisfies the 
axioms of AQFT) then X itself will be inside the scale-/ equivalence class, but if X is 
something else — a string theory, or some theory in which spacetime is quantized — then 
no elements of the equivalence class are fundamental: all are useful purely because of 
their structural resemblance to a subtheory of X. 

3.4 Covariance 

One of the least aesthetic features of discretised QFTs is their non-covariance, which 
manifests itself in (at least) two ways. Firstly, a lattice is inevitably not preserved under 
Poincare transformations (indeed, it is not even preserved under translations). Secondly, 
the commutators of field operators which are spacelike separated are in general not zero 

7 Note that QFT\ and QFT2 must be approximately isomorphic to the same subtheory of X, since 
they are intended to describe the same physical domains as one another. 
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(although they drop off extremely rapidly if the field operators are separated by distances 
large compared with the cutoff distance). This second effect is because of the discretisation 
of the Hamiltonian, in which spatial derivatives like V0(x) are replaced with terms like 
(0(x) — 0(x + 1))//, where l is the cutoff length and 1 is a vector of length l. As a 
consequence, influences can propagate instant aneously from x to x + 1, although the 
effect is miniscule for widely separated points. 

This is not to say that the approach advocated in section [y| implies that the actual 
world has faster-than-light signalling or a preferred reference frame. Recali that in this 
approach, our discretised QFT is taken to describe the actual world only insofar as it is 
isomorphic to a subtheory of the deeper "theory X" , and we expect that this isomorphism 
will hold only for expectation vàlues of field observables averaged over regions large com- 
pared with the cutoff length. But for these expectation vàlues, we find that violations 
of covariance are very small, tending to zero as the regions over which the observables 
are averaged increase in size. In other words, non-covariance is essentially a small-scale 
property of discretised QFTs, and such properties are precisely those for which, it was 
argued, we should not regard QFT as telling us about the actual world. 

Notwithstanding this, the situation is still rather unsatisfactory, at least if we view 
covariance as a fundamental property of physics rather than some effective limit of a 
non-covariant theory. There are a number of ways of coming to terms with this: 

1. We might manage to find a covariant way of defining discretised QFTs. Obviously 
this would remove the covariance problem; however, it seems rather unlikely since 
cutoffs are about short lengthscales, and length is not a relativistic invariant. (Of 
course, if we look for a covariant QFT defined without a cutoff, we are moving away 
from the approach described here, and back towards the AQFT program.) 

2. We might bite the bullet and accept that covariance is only an approximation. This 
would follow, for instance, if X were itself a non-covariant theory (such as Barbour 
( |1994a| , |1994b| , |1999|) has advocated in quantum gravity). 



3. We might find that X is itself a totally covariant theory. There seems no reason why 
such a theory should not have a family of useful non- covariant approximations. 

4. Perhaps most interestingly, X may be a theory which does not involve spacetime 
at all at a fundamental level (as is conjectured would be the case in loop-space 
quantum gravity, or non-perturbative string theory). In this case, relativistically 
covariant spacetimes would have to emerge as effective, approximate concepts from 
such a theory, and it would seem to be a curiosity rather than a pathology if this 
emergence proceded via non-covariant intermediate theories. After all, when we 
say that we expect spacetime to be quantized, we generally do not mean that it 
is literally to be divided into grid squares, but rather that it is to be replaced 
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in toto with some fundamentally discrete entity which in some circumstances is 
approximately isomorphic to spacetime — yet in recovering that spacetime it might 
be necessary to use an intermediate step which does involve treating spacetime as 
a grid. (To take an analogy, we think that the phase space of classical mechanics 
breaks down at scales of ~ h, but that doesn't mean that we think that phase space 
really is a grid of fr-sized squares.) 



4 Inequivalent representations 



In section ^2, we defined algebraic QFTs in terms of a net of operator algebras. The 
usual definition, however, is of a net of abstract algebras, which can be represented as 
Hilbert space operators in a number of inequivalent ways. At first sight these inequivalent 
representations seem to cause problems for QFT since there seems no principled way 
to select the 'right' one; it is also not immediately obvious what has happened to the 
inequivalent representations in our previous quantization (in section |2.3|) of QFT in terms 
of wave-functionals on configuration space. In this section we shall address these issues, 
and will show that inequivalent representations — although important and interesting in 
QFT — pose no problems for its foundations. 



4.1 The algebraic approach 

Consider the quantum mechanics of some finite number of nonrelativistic, scalar partides. 
We could specify this theory by starting with the Hilbert space, and then describing the 
actions of the various observables (position, momentum and functions thereof) on the 
space. We would need to specify a specific function of the observables as the Hamiltonian, 
so as to teli us how states evolve in time. 

Once we have these observables, we could shift to the Heisenberg representat ion by 
using the Hamiltonian to time-evolve the observables: 

X(t) = exp(-itH)X exp(+itH). 

(The Hamiltonian would then be unnecessary, since all the information about the system's 
time evolution would be contained within the time-evolved observables.) Let A be the 
àlgebra of these observables.0 

Now suppose we take A considered as an abstract àlgebra (with the norm topology, 
i. e. a C*-algebra) and discard the Hilbert space. We would still have a great deal of 
dynamical information encoded in the algebraic and topological structure of the àlgebra, 
but would have lost the action of the àlgebra on physical states. 

8 If we were being mathematically rigorous, we would instead work with the àlgebra of bounded func- 
tions of the observables, since X and P are unbounded and thus very awkward to work with. 
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Suppose we try to recover the Hilbert space action. That is, we wish to find an iso- 
morphism between the abstract àlgebra A and some subalgebra of the bounded operators 
on a Hilbert space 7í, i. e. a representation^ of A. If this representation turns out to be 
unique up to unitary equivalence (an assumption which was often made uncritically in the 
early days of quantum theory) then we must have recovered the original Hilbert space, 
i. e. the representation must be isomorphic to the original choice of Hilbert space and of 
observables on it. 

In fact, for a system of finitely many partides the representation is indeed unique - 
provided we require it to be irreducible. This last is the requirement that the Hilbert 
space does not split into orthogonal subspaces preserved by the action of A. Equivalently, 
there must exist in the Hilbert space a cyclic vector whose image under the action of 
A is dense in the Hilbert space. Physically this requirement rules out the existence of 
superselection rules: superselection sectors can be identified with the irreducible sectors 
of a reducible representation. 

By an algebraic quantum theory, we shall mean one which is specified by giving the 
algebraic structure of the observables but not the Hilbert space on which they act. A 
theorem of Stone and Von Neumann telis us that any such quantum theory with a finite 
number of degrees of freedom — i. e. finitely many 'position' and 'momentum' observ- 
ables, corresponding to a classical theory with a finite-dimensional phase space — has a 
unique irreducible representation (up to unitary equivalence) . Hence for such systems the 
algebraic and Hilbert-space ways of specification are equivalent. 

4.2 Infinite-dimensional systems 

When the number of degrees of freedom of a theory becomes infinite, the Stone-von 
Neumann theorem fails. Such theories generally have a large number of inequivalent 
representations, and so giving the algebraic structure does not completely specify the 
theory. 

Nonetheless it still seems natural to specify a theory algebraically, because 

1. The way in which we usually describe a system is to describe its observables. Fur- 
thermore, a theorem of Fell (see below) telis us that no amount of empirical data 
can teli us what the 'real' representation is. 

2. As explained in section |2.2| , the ' quant ization' process begins with the classical 
observables and tries to map them onto quantum-mechanical operators. Since in this 
process we are trying to impose purely algebraic restrictions on this map (specifically, 
the restriction that Poisson brackets go over to commutators) it will generally specify 

9 Technically, a representation need not be an isomorphism. However, here we are working with simple 
C*-algebras, for which all irreducible representations are faithful (i. e. isomorphic to the original àlgebra). 
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only the algebraic structure of a quantum theory, with an ambiguity left as to the 
Hilbert space action. In other words, two quantum theories which are algebraically 
identical could be regarded as equally vàlid quantizations of the classical system.[^] 

If we were to take a robustly instrumentalist viewpoint there would be no particular 
problem: Fell's theorem says that a finite number of measurements, each conducted with 
finite accuracy, cannot distinguish between representations. Hence we could reproduce 
any experimental results using whichever representation was most convenient.^] 

From a realist standpoint, however, we appear to have a dilemma. Even if no empirical 
data lets us distinguish between representations, nonetheless there should be a fact of the 
matter as to which is the 'correct' representation; yet it is provably impossible for us ever 
to discover this fact. To see this, suppose we have two representations on Hilbert spaces 
TCi and 7Í2, and suppose the 'real' representation is the first. Then if the system is in 
some state \A), Fell's theorem telis us that 

For any operation O carried out with finite accuracy on \A), there is a state 
|C;^4) in TC2 such that all data resulting from the operation are consistent 
with the real representation being on 7í 2 and the real state being \0; A). 

In fact, this dilemma will prové only apparent, as we will find in the next three sections 
when we study representation ambiguities in more detail. To do so, we distinguish two 
ways in which inequivalent representations can occur: one associated with the short- 
distance and high-energy ('ultra-violet') degrees of freedom, one with the long-distance 
('infra-red') ones. Given our tolerant attitude to short-distance cutoffs (c. f. section it 
should come as no surprise that all the important sources of inequivalent representations of 
a given QFT fall into the second category: for the Stone-von Neumann theorem guarantees 
uniqueness of representation for any theory with finitely many degrees of freedom, and 
a field theory in any finite region has only finitely many degrees of freedom below a 
finite energy bound. We will argue that the ultra-violet representation ambiguity is a 
pure mathematical artefact, whilst the infra-red ambiguity is physically real but non- 
pathological. 

4.3 Ultra-violet degrees of freedom 

It is probably already clear to the reader why we can ignore the ultra-violet representation 
ambiguity: UV-inequivalent representations occur because of the existence of degrees of 

10 Separately from this, of course, we might not in general expect quantization even to produce a unique 
algebraic structure for the quantum observables, because of ambiguities about operator ordering. 

n It is important in this approach that we have all the observables in the àlgebra. If, for instance, the 
(renormalized) stress-energy tensor is not in the àlgebra, then its vàlues may distinguish representations 
(|Wald 1994|). 
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freedom at arbitrarily short lengthscales, and in section ^] it was argued that real QFTs are 
cut off at short, though finite lengthscales. In particular, the discretised QFTs of section 
|3.3| have only finitely many degrees of freedom per space-time point, and hence no UV- 
inequivalent representations. Since renormalisation theory telis us that any QFT is scale-Z 
equivalent to some such discretised QFT (on lengthscales long compared to the grid size 
for the latter theory), it follows that any occurence of UV-inequivalent representations is 
purely a mathematical artefact and has no physical significance. 



4.4 Inequivalent representations in non-relativistic physics 

The physical significance of IR-inequivalent representations has long been appreciated in 
AQFT; it will be reviewed here from the perspective which this paper adopts towards 
QFT, but with no claim to originality. We begin by returning to non-relativistic QM: 
imagine a line of n two-state systems - - spin-half partides fixed in place, say The 
Hilbert space of each system is two-dimensional, and we can construct the overall Hilbert 
space by taking the n-fold tensor product. The resulting space will have dimension 2 n , 
which is as we expect since we must make n yes-no choices to choose a state with (say) 
given spin in the z-direction for each component system. 

Now suppose our system becomes infinitely large. The dimension of the system will 
be infinite, of course, but it will be a larger infinity than those to which we are used - 
specifically it will be 2 N °, the cardinality of the continuum, which is strictly larger than 
the cardinality K of the integers.0 

It follows that systems with infinitely many components have a Hilbert space which 
is non-separable (i. e. , has uncountable dimension). To see the consequences of this, 
consider the operator àlgebra of our set of two-state systems. It consists of the set of 
linear combinations of spin operators, and hence has countably many linearly independent 
elements. The action of this àlgebra on any given state will generate only countably many 
linearly independent states, hence the action of the operator àlgebra on the total, non- 
separable space is highly reducible. 

In other words, the non-separable space has decomposed into various superselection 
sectors (uncountably many, in fact), each giving an irreducible representation of the op- 
erator àlgebra. 

To see what these sectors are, suppose we start with all components having spin 
up. Then the action of any element of the àlgebra can, at most, cause finitely many 
components to have spin down. So no amount of algebraic action can transform such a 
state into one in which, say, every second component has spin up. This state, in turn, 
can be transformed into other states differing from it in finitely many places, but not into 
a state in which all components are spin down ... or every third component is spin down 



12 See, e. g. , |Enderton (19771) . 
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... or where half the states are spin up but the spin-up states are grouped in pairs . . . 

In other words, the different representations describe states which are 'infinitely' differ- 
ent from one another (see [Morrison (1962| ) for a full discussion). In statistical mechanics, 



different representations describe different phases, since at arbitrarily large length-scales 
- tending to the thermodynamic limit — only the infinite differences between represen- 
tations remain visible. 

For such systems, of course, the use of different representations is an idealization, since 
the actual systems are finite. 



4.5 IR-inequivalence in quantum field theory 

Guided by the nonrelativistic example, we look for inequivalent representations in field 
theory by looking for states differing on asymptotically large scales. The difference from 
the previous section is that here infinite systems are perfectly respectable: in fact, our 
discretised QFTs are effectively infinite crystals. 

This resolves the problem mentioned at the start of section ^ what is the connec- 
tion between inequivalent representations and the wave-functional approach of section 
|2.3| ? The answer is that the space over which the wave-functionals are defined — the 
field-configuration-space S — is infinite-dimensional both because functions may vary on 
arbitrarily short length-scales, and because they may have arbitrary large-distance bound- 
ary conditions. The former reason is nullified when a short-distance cutoff is imposed (as 
in section but the latter one also interferes with the definition of the functional inte- 
gral, and has to be dealt with by imposing boundary conditions at infinity (such as the 
requirement that functions in S be square- integrable). Each choice of boundary condition 
generates a different, and inequivalent, representat ion. 

Examples of inequivalent representations are then: 



Differing mass-densities at infinity 

The closest analogue to the previous section is the case of states with asymptotically 
nonvanishing mass density: average density is superselected ( [Haag 1996| ) and different 



densities correspond to different representations. It is worth noting that this rules out 
the Fock representation for an open universe, even a flat one, since any realistic state 
of such a universe will have nonzero mass density; the Fock representation, which has 
finite-energy states, stands in the same relation to a realistic open-universe representation 
as an asymptotically empty classical spacetime does to a realistic open cosmology ( |Wald 



1981 



The overall velocity of the universe is also superselected, of course, except in the special 
case where the (expected value of the) stress-energy tensor is asymptotically diagonal. 



22 



Differing total charge 

Even when the spacetime is asymptotically empty, inequivalent representations exist. The 
total charge of a system leaves its imprint on the spacetime at arbitrarily long distances 
in the form of the asymptotic Coulomb field, whose flux through a sphere is independent 
of the size of that sphere. As such, total charge is superselected ( |Strocchi and Wightman| 
1976). (Another way to see this is to remember that field lines extend to infinity, so no 



amount of local operations will remove them.) 

In fact, the centre-of-mass velocity of a charged system is also superselected ( |Haag 



1996|) , since the asymptotic field will be a Lorentz-boosted Coulomb field and the associ- 



ated magnètic field lines also extend to infinity. 
Inequivalent vacua 

A major component of the Standard Model of particle physics ( [Peskin and Schroedei| 



1995|) is the presence of field theories with degenerate vacua. Classically this corresponds 
to the existence of nonzero field configurations which are global minima of the energy. If 
the field theory has a continuous global symmetry then there will exist a continuum of 
vacuum states, wave-packets around which will give us the quantum ground states. Since 
these states differ from one another everywhere in space, they too belong to inequivalent 
representations. 

Changes in the interaction parameters 



A famous theorem (Haag's theorem; see [Haag 19961 ) telis us, in effect, that interacting 



fields cannot be represented on the same Hilbert space as the corresponding free fields, 
even when we consider asymptotically early or late field operators. In the current frame- 
work this may be understood as follows: introducing the interaction will change the 
ground state everywhere in space, causing it to be infinitely different from the free-field 
ground state and so in a different superselection sector. 

In fact, and as these examples show, there is not really anything 'quantum' about 
IR-inequivalent representations: in many cases, picking a representat ion is rather like 
picking boundary conditions at infinity in a classical problem. The following classical 
- and utterly banal — version of Fell's theorem shows why the 'dilemma for realism' 



mentioned in section |4.1| is nothing of the kind as far as IR-inequivalent representations 
are concerned: 

Classical Fell 'theorem': the spatial average of a classical quantity over an 
infinite universe cannot be empirically determined by measurements confined 
to a finite spatial region. 
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4.6 Retrospective 

From a realist perspective the sting of the representat ion ambiguity has largely been 
drawn. Locally, any representation ambiguity is artificial, caused by the presence of un- 
physical degrees of freedom beyond the high-energy limit of the theory's validity. Globally, 
there may indeed be representation ambiguities — depending on cosmology, and the topol- 
ogy of the universe — but the inaccessible information which they encode is 'respectable', 
analogous to the classical inaccessibility of the long-distance structure of the universe. 

Nonetheless it would be more aesthetic to lose the long-distance ambiguity as well. 
This can only be done if the universe is finite. There has long been a divide between 
observational cosmology (favouring an open universe) and theory (preferring closed uni- 
verses); this analysis of representations makes a modest contribution to keeping the divide 
open. 

For the purposes of this paper, representational ambiguities need no longer concern 
us. In practice we are always analysing a theory in a finite region — and idealizing the 
system beyond that region in whatever manner is convenient — so different choices of 
representation should not affect our conclusions. 

5 Localisation in quantum field theory 

The phenomenology of particle physics — and indeed, of virtually all of science — makes 
extensive use of the concept of localisation: that is, of the concept that physical systems 
have at least some states which localised in finite spatial regions. There are a number 
of results in AQFT which apparently rule out the possibility of such states, and this is 
sometimes described as a paradox in QFT. In this section the problem of localisation 
will be analysed from the viewpoint of the previous sections, and it will be argued that 
nothing paradoxical is going on. 

In a relativistic theory, there are two natural ways of thinking about localisation: 
spatial localisation, where something is localised in some subregion E; of a spacelike slice 
£ (and therefore not localised anywhere else in £), and spacetime localisation, where 
something is localised in some subregion O of spacetime (and therefore not localised in 
any region of spacetime spacelike separated from O). The latter concept makes sense in 
AQFT (including the scale-Z approximate AQFTs of section j3.3| ); the former needs some 
concrete QFT to be interpreted. In this section we will move freely between the two 
notions, using a discretised QFT where necessary to make sense of spatial localisation, 
and relying on renormalisation theory to ensure that we are not making cutoff-dependent 
statements. 
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5.1 Localisation and the Reeh-Schlieder theorem 



We have already established (in section |2]^) that in QFT the idea of localisation arises 
through the spatial localisation of the field observables 0(x) and 7r(x) (here, for conve- 
nience, we work with a concrete element of the scale-Z equivalence class of the QFT in 
question). But how are we to go from localised observables to localised states? We might 
begin by trying the following (phrased in terms of spacetime localisation): 

Naive localisation: A state is localised in a region O iff A \ if>) = 

for any observable A localised in a region spacelike separated from O. 

This seems plausible when we compare to the classical case: there a state is localised 
in O if 7T (x) = (j)(x) = for any x spacelike separated from O. But it is mathematically 
impossible for any states to satisfy it, for it implies that for any such x = (x, t), and for 

any n, 

^|0 n (x,í)|^) = (^|7r"(x,í)|^) = O. (17) 

But this would imply that |0) was a simultaneous eigenstate of 7r(x) and 0(x), and these 
operators have no eigenstates in common (the mathematics, once we have introduced a 
cut-off to deal with operators defined at a point, is the same as for the nonrelativistic 
operators X, P, which are well-known to have no eigenstates in common). 

Physically it is easy to see what is happening here. The vacuum state of a field theory 
(which we will denote by |íl)) is not 'nothingness', or 'empty space'; it is simply a slightly 
colourful way of describing the ground state of the field's Hamiltonian. In solid-state 
systems this state is just the zero-temperature state of the sòlid, in which the àtoms will 
not be at rest but will have zero-temperature fluctuations; the same will be true for the 
field excitations of a relativistic field theory. 

This suggests, however, an alternative: rather than consider states which are local 
simpliciter, we could consider states which differ only locally. The vacuum could be used 
as a reference state, and states could be declared 'local' if they differed only locally from 
the vacuum. 

To make this quantitative, we need to understand what it could mean for two quantum 



states to 'differ locally'. Recali that in section |2.3| we defined spatially local subsystems 
of the QFT, with Hilbert spaces TC^, which described the degrees of freedom of the 
QFT localised in E 3 -; this formally defined concept can be exactly defined in discretised 
QFT, provided that the grid spacing is small compared to each Ej. If the vacuum were 
a product state with respect to any decomposition of the form Tís = ® 7Íe 2 (where 
E = Ei US2 and EiflE2 = 0)Qthen defining 'differing locally' would be straightforward: 
the vacuum would have form \Q) = \Qi) cg> |Í2 2 ), and any state could be said to differ 

13 Since we are working with discretised QFT here, a more accurate statement would be 'all grid points 
in E are contained either in £1 or £2, and £1 and £2 do not overlap.' 
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from the vacuum only in Ei if (a) \ip) was also a product state, = ® |^ 2 )> an d (b) 

\ik) = \sh). 

However, the vacuum is not a product state, but a highly entangled one (this can be 
readily seen by calculating expectation vàlues such as 

(íí| ?(x)?(2/) |íí) - (íí| \ü) (ü\ (18) 

and showing that they are non-zero for all spacelike separated x, and is discussed 
by |Clifton and Halvorson (2001|) .) As such, defining 'differing locally' is somewhat more 
subtle, since it is difficult and even controversial to say what the state of a subsystem 
of a quantum system is, when the total state of the system is entangled. Some possible 
defmitions might be: 

1. Two states might be said to differ only within Ei if their expectation vàlues coincide 
with respect to all operators localised outside Ei. 

2. We might take the 'state of a subsystem' to be the density operator obtained when 
all degrees of freedom of the other subsystems are traced over. In this case, two 
states \ip a ) and \ipb) could be said to differ only within Ei if (with Trn Sl being 
tracing over degrees of freedom inside Ei) 

Tr n \ip a ) (ip a \ = Tr n \ip b ) (ip b \ . (19) 



3. The problem with the above definition of the state of a subsystem is that we cannot 
then recover the state of the whole system from the states of its components. This 
could be taken to indicate the intrinsic nonlocality of quantum states; this view 
was recently rejected by peutsch and Hayden (2000| ), who propose that locality in 
quantum mechanics is best understood in the Heisenberg picture. They pick a spe- 
cific Hilbert-space vector as a reference state, to be used to calculate all expectation 
vàlues irrespective of the particular initial conditions and then treat the quantum 
state as being specified at time t by the set of all Heisenberg operators at time t. 



14 For a free scalar field, with x = (x, í) and y = (y, i) we get 

<fi| %{x)4>{y) |íï) ~ (fi| 4>{x) |0) (Ü\ 4>{y) p) = i(m 2 - V 2 )-^(x - y); 

smearing <^(x, t) and <^(x, í) out with non-overlapping spatial test-functions /, g, this becomes 
i J s /(x)[(m 2 — V 2 ) _ 2^](x). That this is nonzero for genèric f,g, irrespective of the spatial separa- 



tion of their supports, follows from the antilocality of the operator (m 2 — V 2 )~2 ( Segal 1964 ). Adding 
interactions to the free field will lead to perturbative modifications to these results but should not change 
their qualitative nature, provided the interactions are weak enough at large distance-scales to treat per- 
turbatively — hence adding a A0 4 interaction will have only quantitative effect, but adding a coupling 
to a non-Abelian gauge field may have more dràstic consequences. 
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The obvious definition of 'differing locally' within this framework is that two states 
differ only within Ei if they have the same Heisenberg operators outside E x . 



4. Leaving aside the question of how to specify the state of a subsystem, we might 
define two states as differing only within Ei if they are connected by a unitary 
operator localised within Ei (such an operator will have form <8> ls 2 acting on 
the tensor-product space 7ís = TC^ <S> 'H^)- 

Fortunately, all of these definitions coincide0 and this in turn supports the naturalness 
of each of them as a definition of 'differing locally'. Using the vacuum as our reference 
state, and using for convenience the first definition, we obtain our definition of localised 
states, which we refer to as Knight localisation after [Knight (196 1| ) who first proposed 



such a definition. We state it in the more general context of AQFT (for which reason we 
return to spacetime, as opposed to spatial, localisation): 

Knight localisation: a state is localised in a region O iff A — 
(Q\A\Q) = for any observable A localised in a region spacelike separated 
from O. 

However, Knight localisation differs in one important respect from the sort of locali- 
sation which we encounter in NRQM. In the latter, properties like 'is localised in 0' are 
treatable in the same way as properties like 'has energy £" or 'has momentum less than p K . 
that is, we can define a projection operator whose intended interpretation is 'localised in 
0\ whose range is the space of all such states. This would be possible for Knight-localised 
states iff they form a subspace: that is, iff any superposition of two states Knight-localised 
in O is also Knight-localised in O. 

The fact that Knight-localised states do not have this property is a consequence of the 
Reeh-Schlieder theorem ( [Reeh and Schlieder 1961]) . 



Reeh-Schlieder theorem: for any region O, the set of vectors A(0) \Q) 
generated by the action of operators localised within O upon the vacuum, 
spans the Hilbert space of the QFT. 



(For a proof, and further discussion, see [Haag 1996| .) It follows]^] from the Reeh-Schlieder 



theorem that states Knight-localised at O span the entire state space, which rules out 
any possibility of a projector meaning 'localised with certainty in 0\ 



15 The equi valence of (1), (2) and (4) is trivial; the equivalence of (3) and (4) is given in [Dcutsch and 
Hayden 200C| (and is, again, trivial once Deutsch and Hayden's formalism is vmderstood) 



16 To see that it follows, we need only note that the unitary elements of a (bounded) operator àlgebra 
A(0) span A(0). This can be proved as follows: for any bounded Hermitian element H of A(0), and 
any t ^ 0, (ïí) _1 (exp(iíií) — 1) is a linear combination of unitary elements of A(0). As t — > 0, this 
sequence tends to H, hence H is in the span of the unitary operators. To complete the proof, simply 
recali that any linear operator can be written as A + iB, where A and B are Hermitian. 
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It has long been understood that the Reeh-Schlieder theorem is a consequence of the 
entanglement of the QFT vacuum. Non-relativistic quantum mechanics furnishes us with 
many examples where, given an entangled state of the Hilbert space Ha ® Hb, unitary 
operations on Ha alone suffice to produce a basis for Ha ®Hb- One example, which 
plays an important role in quantum teleportation flBennett, Brassard, Jozsa, Peres, and| 
|Wootters 1993Q , is the Bell basis: we take a system of two qubits (i. e. , two-state systems) 



and prepare them in one of these four states 



\B1) = ^(|0>®|0> + |1)®|1)) 

\B2) = ^(|0>®|0>-|1>®|1>) 

\BS) = ^(|1)®|0) + |0)®|1)) • [ZÜ) 

\B4) = ^(|1)®|0}-|0)®|1)) 

Obviously this is a basis for the combined two-qubit system, but also any element of it is 
cyclic under unitary operations carried out on the first qubit alone, as is easily shown: 

(Ma ® ï) \B1) = \B2) 

(M B ®Ï)\B1) = \B3) (21) 
(M c ®í)\Bl) = \BA) 

where 

M A = 






in the (|0) , |1)) basis. In other words, even if the qubits are macroscopically separated, 
if they begin in an appropriately entangled state then it is possible by operating on one 
qubit to generate a set of states which span the Hilbert space. (Examples of this kind are 
analysed in rather more detail by [Redhead (1995| ) and |(Jlifton and Halvorson (200 1| ).) 



Since (as discussed above) the QFT vacuum is entangled, the Reeh-Schlieder theorem 
should come as no surprise to us. Nor does it cause any logical problems for Knight's 
defmition of localisation: there are many perfectly respectable properties of quantum 
states which are not preserved under linear superposition, such as being an eigenstate of 
energy, or being an entangled state. 

It does, however, cause us practical problems. The sort of localisation which we 
use in NRQM is preserved under linear superpositions, and this fact is essential to the 
analysis of NRQM problems — so, at least on the face of it, NRQM localisation and 
Knight localisation must be different concepts. Nor is the problem confmed to NRQM: 
in scattering theory, for instance, it is crucial to be able to discuss the amplitude for 
a particle to be scattered to a given area, and such discussions presuppose that spatial 
localisation is preserved under superpositions. 

So what is going on? In the face of the Reeh-Schlieder theorem, there is little prospect 
of finding an alternative to Knight localisation which does have the required properties 
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Halvorson and (Jlifton (2ÜÜ1 ) for a wide variety of no-go theorems); so, how are 
NRQM, and scattering phenomenology, compatible with QFT? 

The key to this question is the fact that NRQM is not supposed to be perfectly com- 
patible with QFT: rather, it is supposed to be an approximation to QFT vàlid only in 
certain regimes of QFT. (And the same is true for scattering theory, which is a marvellous 
tool to analyse high-energy col·lisions, but of limited use in understanding, e. g. , quark 
confmement.) With this in mind, we will lower our sights and seek an approximate form 
of locality which, in certain domains of QFT, will be approximately preserved under lin- 



ear superposition. Section ^]2| will construct such an approximation, and section will 
address the question of whether it is after all adequate for our needs. 



5.2 Effective localisation 

Our approximate concept of localisation will be defined as follows (it is again defined in 
terms of spatial localisation, partly for ease of comparison with nonrelativistic localisa- 
tion): 

1. Effective localisation (qualitative form): A state is effectively 
localised in a spatial region Ej iff for any function / of field operators <f), íf, 

f — (0| / \Q) is negligibly small when / is evaluated for field operators 
outside Ei, compared to its vàlues when evaluated for field operators within 
Ei. 

2. The effective localisation principle (ELP) (qualitative form: A 

subspace 7i of the QFT Hilbert space Tí^, obeys the ELP on scale L iff for any 
spatial region S large compared with L, a superposition of states effectively 
localised in S is effectively localised in effectively the same region. 

These qualitative notions can be made precise in a number of ways, such as: 

1. Effective Localisation (quantitative form): A state is L-localised in a 
region Ej, iff for any function / of field operators </>, íf , f — (fi| / \Q) falls 
off for large d like (or faster than) exp(— d/L), where d is the distance from 
Ej at which the function (/) is evaluated. (Note that there is no difference, 
according to this definition, between a state L-localised at some spatial point 
x and a state L-localised in a region of size ~ L around x.) 

2. ELP (quantitative form): A state obeys the ELP on scale L iff, for 
any 3-sphere S of radius > L, a superposition of states L-localised in S is 
L-localised in S. 
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Obviously ELP cannot hold on any scale for the Hilbert space 7is of the full QFT, 
on pain of violating the Reeh-Schlieder theorem (the space of states effectively localised 
in a region obviously includes all those Knight-localised in a region, and as mentioned in 
section I5TT1 , the span of all such states is the whole of TCs-) However, if some subspace TC 
of TCs is such that: 

1. ELP holds, for all regions large compared with some lengthscale L; 

2. the current state of the QFT lies within 7í; and 

3. 7í is approximately conserved, on timescales of interest to us, by the QFT dynamics 

then we will, approximately, be able to define projection operators in 7i which project 
onto states localised in a given region, provided that that region is large compared to L. 
(Note that L is nothing to do with the cutoff lengthscale l, except that L must be large 
relative to / for our results to be cutoff- independent.) 

Why should we expect ELP to hold for any subspaces of 7íj{! To answer this, we 
need to analyse the entanglement of the vacuum in somewhat more detail. For if it were 
completely non-entangled, ELP would indeed hold for all of Hs, as argued for above. 
We might expect, then, that if the vacuum entanglement drops off rapidly above certain 
distance-scales, then above those scales localisation would be 'almost' preserved under 
superposition, and only extremely careful superposition of states would spoil this; hence, 
we would expect it to be fairly typical for a given subspace of 7is to satisfy ELP on these 
lengthscales. 

Does vacuum entanglement in fact decrease with distance; equivalently, do the cor- 
relations in the vacuum decrease with distance? For a massive QFT it can be shown 
that they drop off exponentially, with the lengthscale given by the 'Compton wavelength' 
1/m (where m is the renormalised mass term in the QFT Hamiltonian) : this is provable 
rigorously for any QFT satisfying the AQFT axioms ([Fredenhagen 1985| ), and can be 
calculated perturbatively in the case of QFTs with weak large-scale interaction terms. 
(QCD, and other theories with strong long-distance interactions, may not be covered by 
either case — rather little is known about the QCD vacuum, in fact.) 

To see intuitively the significance of the 1/m lengthscale, recali that the Hamiltonian 
for a free scalar QFT has the form 

H*[ X ] = ~^d 3 x (^^-|^ + (Vx)(x) 2 + m 2 X (x)^ ífe], (22) 

and that in a discretised QFT we replace derivative terms like V0(x) with terms like 
(0(x) — 0(x + 1))//, where |1| = l and / is the cutoff length. These discretised derivative 
terms are the only ones in the Hamiltonian which lead to entanglement, as they are the 
only ones which couple observables at different lattice points. 
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Now there is no requirement that l <C 1/m — all we require of l is that it is small com- 
parec! with the lengthscales at which we want to study the QFT's structure. Admittedly, 
if l is smaller than, or comparable to, 1/m, then the coupling terms (ç6(x) — 0(x + 1))//, 
which are responsible for entanglement, will be significant relative to the other terms in 
the Hamiltonian. However, as l becomes large compared with 1/m, the coupling terms 
become only a small perturbation on the Hamiltonian 



m\x] = - / d 3 x - + m 2 X (x) 2 vD[ x ], (23) 




which is the Hamiltonian of a set of uncoupled oscillators, one at each lattice point. Thus, 
on such scales we should expect to find the ground state only slightly entangled. 

So, we expect that for a massive QFT with mass m, we should find it to be common 
for the ELP to hold for subspaces on lengthscales large relative to 1/m. Obviously, this 
heurístic argument has to be tested for any specific subspace in which we are interested. 
In particular, we find it to hold for the one-particle sector of any free (or asymptotically 
free) massive bosonic QFT, suggesting that such theories have a quite satisfactory notion 
of particle localisation (this result is proved, and analysed in extenso, in |Wallace (2(JUla| )). 



For a QFT which describes some massless fields, the vacuum correlation functions still 
drop off with distance, but follow a power-law dropoff rather than an exponential one; 
hence there is no characteristic lengthscale on which correlations exist. This does not rule 
out finding subspaces of the Hilbert space of the QFT for which ELP applies on some 
useful scale; however, it suggests that the scale will vary according to the specific problem 
being examined. 

Furthermore, we expect there to be physically relevant sectors of at least some mass- 
less QFTs in which states are localised only on extremely large lengthscales (if at all). 
For instance, there should be a low-energy sector of QED in which electron number is ap- 
proximately conserved, in which the electromagnètic field has effectively no independent 
degrees of freedom, and in which each electron has an associated Coulomb field. In such 
sectors, the state of the electromagnètic field is not really localised at all, or at least not 
on any characteristic lengthscale; analysis of such sectors would be necessary to establish 
the validity of ELP for nonrelativistic, interacting partides. (See chapter VI of |Haag 



|( 1996|) and references therein for further discussion of the state space of QED, from the 
perspective of AQFT.) 



5.3 The conceptual justification of effective localisation 

We have now constructed a definition of localisation which approximately has the prop- 
erties we wish: that is, it designates a set of states which are approximately localised, 
and that set is closed under linear superpositions. But is this approximate localisation 
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good enough? After all, in general 'effectively localised' states are not exactly localised 
- particle states, in particular, invariably differ from the vacuum everywhere in space. 
It follows that particle creation and annihilation operators cannot be exactly localised, 
which seems to preclude localised particle-detecting devices. 

This problems has recently been addressed from an AQFT viewpoint by |Halvorsorï 



|and Clifton (200"T|) (see also Haag (1996 ) for a more technical discussion along similar 



lines). Their argument is as follows: 

1. What we actually measure are (exactly) local operators. 

2. Although partides cannot be detected by measurements of any local operator, they 
can be detected by measurements of operators which are very close (in operator 
norm) to local operators. 

3. It follows that to a high degree of accuracy, we can detect partides; however, the 
detection will not be 100% reliable. 

In their approach, 

It is not (strictly speaking) true that we observe partides. Rather, there 
are 'observation events', and these observation events are consistent (to a 
good degree of accuracy) with the supposition that they are brought about by 
(localizable) partides. flHalvorson and Clifton 2001 ) 



From the perspective of this paper, the problem with this approach is its a priori assump- 
tion that what we measure are always exactly localised operators. This is, of course, an 
interpretive axiom of AQFT as it is often presented, but it effectively assumes the presence 
of outside observers whose measurements cannot be treated within the ordinary dynamics 
of the QFT. We shall instead construct an account which treats observers as part of the 
infernal dynamics of the system (although, apart from that difference of emphasis, the 
solution below will be rather similar in character to that of Halvorson and Clifton). 

If we wish instead to treat our QFT as a closed system, and the measurement process 
as part of the infernal dynamics of that system, then it is an open question whether or 
not we must treat question our measurements as localised. Furthermore, we have at least 
some reason to think that the answer to the question is negative — for we believe our 
measuring devices (including ourselves) to be made out of partides, and we have already 
noted the fact that partides are never exactly localised. 

But does it even make sense to consider non-localized measuring devices? To see that 
it does (at least in some sense), we return (again) to non-relativistic quantum mechanics. 
In NRQM it is unproblematic, at least in principle, to construct measurement devices 
out of àtoms. Such devices usually have rather large masses, so despite the uncertainty 
principle they can be put in states whose position and momentum are both very sharp. 
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In particular, if a measurement device's momentum is sharply peaked around zero then 
the system will effectively remain in a single well-determined region. 

But will it exactly remain in that region? The mathematics says not, for it is math- 
ematically impossible for any centre-of-mass wavefunction to remain localised in a finite 
spatial region for any finite period of time. (A rather general proof of this result has been 
given by Hegerfeldt (|1998a| , |1998b|) , and is discussed by [Halvorson and Clifton (2001 ).) 
This is implicit, in fact, in the wave-functions we usually use in NRQM to describe 'lo- 
calised' systems: they are generally described by fairly sharp Gaussians, but any Gaussian 
— no matter how sharp — is non-zero everywhere in space. 

Does this mean that the Geiger counter in my lab has finite amplitude to be in An- 
dromeda? No, for the representation of a quantum object by a Gaussian state relies on 
the fact that the object is not entangled with its environment. But the part of the wave- 
function representing the device as (for instance) in the walls of the lab will obviously 
interact with its environment in a rather different way from the part which represents the 
device as in the middle of the lab. As such, the device should not strictly speaking be 
represented by a pure state at all, but is instead entangled with its surroundings. 

The reason why we do not in practice need to allow for this entanglement is that it 
is ludicrously small. Consider, for instance, a device of mass m whose centre-of-mass 
wave-function has Gaussian form 

Í) oc exp(-r 2 /2L 2 ). (24) 

If the device is allowed to evolve freely, after some time t the centre-of-mass wave-function 
will evolve to 

ip(r,t) oc exp(-r 2 /2L 2 ) (25) 

where L 2 = L 2 +it/2m. The probability density for finding the particle a distance r from 
the origin at time t is 

\Íj(r,t)\ 2 ^exp(-r 2 /D 2 ) (26) 

where D 2 = (L 4 + t 2 /Am 2 )/ L 2 ; hence the device's wave-function will have essentially 
constant half-width as long as t <C 2mL 2 . For a one-kilogram device whose initial half- 
width is 10 _11 m, this means that the half-width will stay approximately constant on 
timescales of order 10 6 years. The probability of finding the particle in a given volume a 
distance r from the origin, compared with the probability of finding it in a similar-sized 
volume around the origin, is \ip(r, 0)\ 2 /\ip(0, 0)| 2 = exp(— r 2 /L 2 ). If we take r = 1/íhi, 
then this probability ratio is approximately equal to 10~ 10 . Only if we are working to 
10 10 -significant-figure accuracy do we need to allow for terms like this, and calculations 
are seldom done to such accuracy! 

The truth, of course, is that the real state of any realistic model of NRQM is an 
entangled mess. It contains no exactly localised objects and no truly isolated subsystems, 
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and there is no chance of working out its exact dynamics. Our actual strategy is to find a 
decomposition into subsystems such that they are approximately isolated, and to accept 
the errors that may result from this strategy. 

We can quantify this by means of the Hilbert-space norm. Let \ifj) be the actual state 
of the system, and U(t) the unitary operator generating its actual time evolution. Then 
to say that the state is approximately two isolated subsystems (during a period of length 
T) is to say that: 

1. There is some state |~ ip) = \ip\)®\4>2) such that | ~ ip) | ~ 1; 

2. There is some unitary operator U^(t) = U\(t) ® U^it), such that, for t < T, 

(Note that we are not requiring U ^ (t) ~ U (t) in any sense; our discussion is entirely 
state-dependent.) 

If our standard description of a certain complex object (a DNA strand, a Geiger 
counter, a computer, whatever) describes it as an isolated object, and if isolated subsys- 
tems are only ever approximations to the 'true' quantum state, it might seem that we 
should say something like 'complex objects don't exist, they only approximately exist'. 
But this is to misunderstand the status of complex objects in physics. Such objects are 
actually identified by their structural properties (interacting with other DNA strands in a 
certain way, detecting radiation, calculating ir, whatever) and such structural properties 
can tolerate small amounts of noise. (Put another way, an object which operates like a 
computer with 99.999999 ... % accuracy is still a computer!) From this viewpoint, to say 
that a certain complex object is present is to say that making that approximation is an 
extremely effective method of analysing the dynamics. (See |Wallace (2001b|) for a more 
extended discussion of this approach to complex objects). 

Localisation enters this framework as a pragmàtic criterion for isolation. If, in the 
state |~ ip) = IV'i)® the states and l^) have negligible amplitude to be in the 
same spatial location, and if the actual dynamics (as described by U(t)) are generated 
by a Hamiltonian in which the inter-particle forces drop off strongly with distance, then 
we can apply the isolated-subsystem approximation with considerable confidence as to its 
accuracy; conversely, if at some point this approximation predicts that l^i) and ) have 

17 There are two simplifications in this account. Firstly, in general a state of NRQM contains large 
numbers of identical partides, so the tensor product in (1) should be symmetrised and/or antisymmetrised 
to allow for these partides. Secondly, many subsystems (particularly those of macroscopic dimension) 
undergo decoherence, and hence cannot litcrally be said to be isolated in the sense used here. For 
systems whose dynamics are regular (i. e. non-chaotic) this is not particularly significant: decoherence 
will be significant if the system is not in some state of a certain preferred basis, but if it is in such a state 
then the isolated quantum dynamics will fairly accurately predict its evolution. For chaotic systems we 



cannot ignore decoherence in this manner (see |Zurek (1998b| ) for details) 
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come to be localised, with high amplitude, in the vicinity of one another, then we should 
abandon the approximation from that point onwards. 

This means that — not withstanding the existence in NRQM of a precisely definable 
position operator — we should treat the spatial location of a given object as approximate. 
It is not that my Geiger counter is localised in a number of exact locations with different 
amplitudes for each; rather, it is determinately[^| localised in some approximately-defined 
location. The exponentially small tails of the centre-of-mass wavefunction are totally 
irrelevant to any considerations of the counter's spatial location, for it is only in the 
approximation where these are neglected that we can treat the Geiger counter as isolated 
anyway. 

Before applying these ideas to QFT, we should acknowledge a lacuna in the argument: 
we have assumed that the Hilbert-space norm is the appropriate measure of closeness of 
approximation, but what justifies this assumption? From a purely mathematical point 
of view, to be sure, it is an extremely obvious choice: it is a very natural mètric on 
the space of states, and the unitarity of Hilbert-space dynamics means that the Hilbert- 
space distance between two points is time-invariant. But the situation is more complicated 
than this: [Reutsche (1998| ) (writing in the context of the modal interpretation of quantum 
mechanics) points out that whilst a state like 

a | Spin up) + (3 | Spin down) (27) 

(where a ~ 1 and (3 ~ 0) may legitimately be regarded as 'almost' a spin-up state, it is 
highly problemàtic to suppose that a macroscopic superposition like 

ot |Livingcat) + (3 |Deadcat) (28) 

is almost a live cat, whatever the vàlues of a and (3. She describes this sort of ap- 
proximation (borrowing her terminology from [leller 1984| ) as "ontological distortion" , as 



against the mere "numerical distortion" that occurs when, say, the position of a classical 
particle is predicted with slight inaccuracy: a macroscopic superposition is not (prima 
facie) even approximately the same as a system in a macroscopically defmite state, but 
an ontologically new and problemàtic entity. 

It is now clear that justifying the use of the Hilbert-space norm in this way takes us into 
the murky wàters of the quantum measurement problem. We can see this another way by 
considering how easily an instrumentalist could dismiss our worries: from his perspective, 
the empirical content of the Hilbert-space norm is simply given by the probabilities of 
measurements to yield certain results, and states differing only very slightly in Hilbert 
space will give only slightly different probabilities for a given outcome on measurement. It 

18 No comment on the measurement problem is intended here; Everettians should read this as 'deter- 
minately localised rclativc to the branch which we are considering'. 
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is the illegitimacy of using "measurement" as a primitive concept in our approach which 
causes the difficulties. 

In fact (staying in the non-relativistic domain) our problem is very closely related to 
some Standard foundational qüestions in a number of interpretations: 



In the modal interpretation flvan Fraassen 1991~| |Dieks and Vermaas 1998|) the ques 



tion is essentially equivalent to the Albert-Loewer (Albert and Loewer |1990| , |1991| ; 
lAlbert 1992|) probl em of vagueness, discussed by |Rcutschc (1998j ) (see also references 
therein): the idea of a modal interpretation is to pick out a certain set of properties 
as defmite, whilst it seems likely that the properties actually picked out are slightly 
different from the "right" ones — where 'slightly' means 'slightly in Hilbert-space 
The Standard ('vague property') response to this is to replace precise prop- 



norm 



erties (like spatial localisation) with a family of 'vague' properties, all very close in 
Hilbert-space norm to the 'precise' property under consideration.[^] 



In col·lapse interpretations such as those of |Ghirardi, Rimini, and Weber (1986|) and 
Pcarlc (1989| ) the question is equivalent to the 'problem of tails' ( Albert and Loewcï 



1996|) : the collapse mechanism is generally chosen to cause the wave-function to 



be sharply peaked around a given spatial position but it is impossible to construct 
mechanisms which prevent the wave-function having exponential tails extending 
to infmity. The problem of tails can be addressed ( |Albert and Loewer 1996]) by 
postulating some rule such as "a particle is localised in spatial region 71 if the squared 
modulus of the wave-function integrated over 1Z exceeds (1 — p)" , where p is taken 
as small. There has been recent controversy flLewis 1997| ; |Ghirardi and Bassi 1999j ; 
Bassi and Ghirardi 1999| , Clifton and Morton |1999| , |2000| ) as to whether this rule 
should be understood as a metaphysical principle (which leads to difficulties both 
because of the arbitrariness of p and because of alleged failures of the arithmetic of 
macroscopic objects) or, as advocated by |Clifton and Monton (1999| ), as a mere facon 
de parler useful in describing the wave-function (in which case there is presumably 
need for a defence of treating the tails as irrelevant to observational predictions) . 

From the viewpoint of the Everett interpretation (at least in the form advocated by 
the author ( |Wallace 200 lb| ; |Wallace 2001d| ), which is essentially that developed by 
Zurek(most explicitly in |Zurek 1998a| ) and Saunders ( |1995| , |1996| , |1998j )) our problem 



19 In fact, the considerations of this section suggest a more serious problem for the modal interpretation: 
the very concept of macroscopic (non-relativistic) subsystems is threatened by wave-function spreading. 



Reutsche (1998) discusses a library book, localised approximately in her ofhcc but with exponential 
tails giving it a nonzero amplitude to be anywhere in space, and claims that such a book 'is in no way 
compatible with my ordinary notion of located in my offi.ce" ((Reutsche 1998, p. 233); italics hers) — but 



in fact, the interaction of those tails with the walls of the ofhce will lead to the sort of ràpid interaction 
that removes any prospect of distinguishing 'book' and 'wall' subsystems. 
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is that of justifying the neglect of parts of the wave-function of extremely low weight. 
This is essentially the probability problem of Everett interpretations: what justifies 
our strategy of disregarding low-weight branches when making decisions? 

Although expanding on these rather brief comments would take us deep into interpre- 
tational qüestions and far beyond the scope of this paper, it seems likely that (at least 
for the interpretations above) a resolution of the quantum measurement problem would 
include a justification of the Hilbert-space norm as the appropriate measure of approxi- 
mation. This seems particularly certain in the Everett interpretation, which is probably 
the interpretational program most naturally extendible to QFT and most in keeping with 
the spirit of this paper. 

Given an interpretational justification of our use of the Hilbert-space norm, the con- 
cepts above carry through to QFT with only minor changes. The dynamical impossibility 
of constructing exactly isolated subsystems in NRQM becomes a kinematic impossibility 
in QFT if we consider the spatially localised subsystems TC^, but for essentially the same 
reasons: interactions between those subsystems are so prevalent that the energy cost of 
constructing non-entangled states would take us out of the domain of validity of the QFT 
altogether. Also, in QFT it is even more apparent than in NRQM that objects can only 
be defined approximately unless they are isolated, for all that exist in QFT are the field 
observables, excitations of them, and patterns in those excitations, and if two patterns 
overlap and interact then there can be no exact criteria for individuating them. And in 
QFT there are no action-at-a-distance interactions, so spatial location is an even better 
criterion for isolation: two subsystems can be treated as isolated iff they are effectively 
localised in spatially separated regions. 

Consequently, the approximate criteria developed above for NRQM apply just as well 
in QFT. In particular, QFT partides (unlike NRQM partides, which are part of the bàsic 
formalism) are themselves just certain patterns of excitations in the fields. As such, the 
appropriate concept of localisation for them is approximate, and so perfectly adequately 



treated by the methods of section |57|. If we say, for instance, that a particle is localised 
in (spatial) region Ej, and that the region in the vicinity of Ej is empty, then what we 
mean is: 

• that the field state restricted to E, coincides almost exactly with the restriction to 
Ej of some state in the one-particle Hilbert space T~L\p] 

• that in the region surrounding E; the field state almost coincides with the vacuum. 

These requirements are perfectly consistent with the actual field state being that of a 
particle (or of a particle together with some other excitations, created by operators strictly 
localised far from Ej); they are also consistent with the actual state itself being strictly 
(i. e. , Knight-) localised in Ej. The point is, we neither know nor care: different ways 
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of realising the requirements just mean different — but tiny — amounts of noise added 
to our highly-accurate description of the quantum state and of its dynamics. Ditto for 
particle detectors: there is no more need (and no more prospect) to regard them as exactly 
localised, provided that they are very well localised. If we model such a detector as made 
out of partides then that model will describe it as differing from the vacuum all the way 
to infinity, but it will also describe those differences as incredibly small beyond a certain 
very-well-defined, but not precisely defined, region. Then if the actual state of the QFT 
is almost the same as that of the model within that region, and is almost the same as the 
vacuum outside that region, then the criterion has been met for us to say that the region 
contains a detector.^] 

6 Conclusion 

This completes our analysis of Lagrangian QFTs. We have argued that such QFTs can be 
made into perfectly well-defmed quantum theories provided we take the high-energy cutoff 
absolutely seriously; that the múltiple ways of doing this are not in conflict provided that 
we understand them as approximations to the structure of some deeper, as yet unknown 
theory; that the existence of inequivalent representat ions is not a problem; that a concept 
of localisation can be defined for such theories which is adequate to analyse at least 
some of the practical problems with which we are confronted; and that the inexactness 
inherent in that concept is neither unique to relativistic quantum mechanics, nor in any 
way problemàtic. 

If there is one underlying theme to the approach to QFT advocated in this paper, it 
is this: the sort of information which we are interested in getting from physical theories 
is structural information. The reason that states with isolated subsystems approximate 
real quantum states well (both kinematically and dynamically) is because the two have 
virtually the same structure; the reason that we can be indifferent as to which concrete 
realisation of a scale-Z AQFT to use is that structurally they are all virtually identical 
(at least until we get to scales of order l, at which point we no longer believe that any of 
them give us vàlid structural information). 

The need to understand theories in structural terms is not restricted to QFT, but it is 
more starkly obvious there. It is not just that QFT is strictly speaking false, or that it is 
expected to be replaced someday by a deeper 'theory X'; that is true for virtually every 
physical theory we study. It is rather that — if Wilson's approach to renormalization is 
taken seriously — QFT only makes sense if we include in it some vestigial aspects of the 

20 We could, to be sure, define a detector as being present if the QFT state is almost the same as the 
model state within the localisation region, and exactly the same as the vacuum outside it — but what 
would be the justification for so doing? We don't have any reason to suppose that actual lab detectors 
are precisely localised in this way. 
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very theory which we expect to replace it. 

From this viewpoint, we can see that Lagrangian QFT (as I have defended it) is not 
really in conflict with AQFT at all. Success in the AQFT program would leave us with 
a field theory exactly defined on all scales, and such a theory would be a perfectly vàlid 
choice for 'theory X': furthermore, even if we found such an exact QFT it would not 
prevent us from defining low-energy, 'effective' QFTs — which would not be well defined 
without a cutoff; nor, probably, would it obviate the need for these theories in describing 
certain low-energy limits of X. 

If AQFT has any rival programs, in fact, they are string theory and other theory-of- 
everything candidates. Success in any of these programs would, of course, revolutionise 
physics, but that success would scarcely change the current status of Lagrangian QFT: as 
an inherently approximate, but nonetheless extraordinarily powerful tool to analyse the 
deep structure of the world. 
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